The Relationship between Codon Boundaries and Multiple Reading-Frame Preferences: Coding Organization of Bacterial Insertion Sequence9
نویسندگان
چکیده
Theoretical considerations have shown that the five possible overlapping reading-frame configurations differ significantly in their coding flexibility and thus in their information content (Siegel and Fitch 1980; Smith and Waterman 1980). Contrary to expectation, the overlapping frame configuration allowing the greatest coding flexibility is rarely seen, whereas one of the most constraining is common. We point out here that this overlapping reading-frame paradox and an observed but unexplained preference in coding regions for a pyrimidinepurine at codon boundaries (Shepherd 1981; Jones and Kafatos 1982; Smith et al. 1983) are intimately linked. The codon boundary preference, which may be related to translation efficiency or accuracy, places constraints on the evolution of overlapping coding regions. These considerations may help identify actual coding regions in DNA sequences. We have analyzed five sequenced (enteric) bacterial insertion sequences for codon boundary incidences and reading-frame configurations and find that they are consistent with these proposed constraints.
منابع مشابه
Prokaryotic Genome Annotation Pipeline
The process of annotating prokaryotic genomes includes prediction of protein-coding genes, as well as other functional genome units such as structural RNAs, tRNAs, small RNAs, pseudogenes, control regions, direct and inverted repeats, insertion sequences, transposons, and other mobile elements. Bacterial and archaeal genomes have the considerable advantage of usually lacking introns, which subs...
متن کاملIntrons and reading frames: correlation between splicing sites and their codon positions.
Computer analyses of the entire GenBank database were conducted to examine correlation between splicing sites and codon positions in reading frames. Intron insertion patterns (i.e., splicing site locations with respect to codon positions) have been analyzed for all of the 74 codons of all the eukaryote taxonomic groups: primates, rodents mammals, vertebrates, invertebrates, and plants. We found...
متن کاملThe codon preference plot: graphic analysis of protein coding sequences and prediction of gene expression
The codon preference plot is useful for locating genes in sequenced DNA, predicting the relative level of their expression and for detecting DNA sequencing errors resulting in the insertion or deletion of bases within a coding sequence. The three possible reading frames are displayed in parallel along with the open reading frames and plots of the location of rare codons in each reading frame.
متن کاملTranslation of the F protein of hepatitis C virus is initiated at a non-AUG codon in a +1 reading frame relative to the polyprotein
The hepatitis C virus (HCV) genome contains an internal ribosome entry site (IRES) followed by a large open reading frame coding for a polyprotein that is cleaved into 10 proteins. An additional HCV protein, the F protein, was recently suggested to result from a +1 frameshift by a minority of ribosomes that initiated translation at the HCV AUG initiator codon of the polyprotein. In the present ...
متن کاملTranslation of the F protein of hepatitis C virus is initiated at a non-AUG codon in a 11 reading frame relative to the polyprotein
The hepatitis C virus (HCV) genome contains an internal ribosome entry site (IRES) followed by a large open reading frame coding for a polyprotein that is cleaved into 10 proteins. An additional HCV protein, the F protein, was recently suggested to result from a 11 frameshift by a minority of ribosomes that initiated translation at the HCV AUG initiator codon of the polyprotein. In the present ...
متن کامل